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Abstract. In this paper, we consider the product-limit quantile estimator of an unknown 
quantile function under a censored dependent model. This is a parallel problem to the 
estimation of the unknown distribution function by the product-limit estimator under the 
same model. Simultaneous strong Gaussian approximations of the product-limit process 
and product-limit quantile process are constructed with rate 0((log n)~^) for some A > 0,. 
The strong Gaussian approximation of the product-limit process is then applied to derive 
the laws of the iterated logarithm for product-limit process. 

1. Introduction and Preliminaries 

In medical follow-up or in engineering life testing studies, the life time variable may 
not be observable. Let Xi, . . . , X„ be a sequence of life times, having a common unknown 
continuous marginal distribution function F, with a density function f = F' and hazard 
rate X = f/{l — F). The random variables are not assumed to be mutually independent 
(see Assumption (1) for the kind of dependence stipulated). Let the random variable Xi 
be censored on the right by the random variable Yi, so that one observes only 

Zi = XiAYi and 6i = IiXi<Yi), 

where A denotes minimum and /(.) is the indicator of the event specified in parentheses. In 
this random censorship model, we assume that the censoring random variables Yi , . . . , 1^ 
are not mutually independent (see Assumption (2) for the kind of dependence stipulated), 
having a common unknown continuous d.f. G, and that they are independent of the Xi^s. 
Since censored data traditionally occur in lifetime analysis, we assume that Xi and Y^ are 
nonnegative. The actually observed Zj's have a distribution function H satisfying 

H{t) = l-H{t) = {l-Fmi-G(,t)). 

Denote by 

F4t) = P{Z <t,6 = l), 
the sub-distribution function for the uncensored observations. Define 

n n 

Nn{t) = Y,I{Zi<t,5 = l)=Y, HXi < t A Y), 

i=l i=l 
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the number of uncensored observations less than or equal to t, and 

n 

Yn{t) = Y,I{Zi>t), 

i=l 

the number of censored or uncensored observations greater than or equal to t and also the 
empirical distribution functions of H{t) and F^{t) are respectively defined as 

F„(t) = n-^Y^{t) , iV„(i) = n-^N^{t). 

Then the Kaplan-Meier estimator for 1 — F{t), based on the censored data is 

(1-1) ^-^n{t)=ll{l-^^), t<Z^^^, 



s<t 



where Z^j) are the order statistics of Zj and dNn{t) = Nn{t) — Nn{t—). 

As is known (see, e.g., Gill, 1980), for a d.f. F on [0, oo), the cumulative hazard function 
A is defined by 

and A(t) = —log{l — F{t)) for the case that F is continuous. The empirical cumulative 
hazard function A„(t) is given by 

which is called the Nelson- Aalen estimator of A[t) in the literature. 

For a censored model with {Xi;i > 1} and {Yf,i > 1} independent and identically dis- 
tributed (i.i.d) sequences and mutually independent. Burke et al. (1981, 1988), established 
strong Gaussian approximation of the product-limit(PL) process \/n{Fn{t) — F{t)) by a 
two parameter Gaussian process at the almost sure rate of 0(n~^/^(log n)^). In left trun- 
cation and right censorship(LTRC) model, Zhou and Yip (1999) initiated and Tse (2003, 
2005) established strong Gaussian approximation of the PL-process by a two-parameter 
Gaussian process at the almost sure rate of 0((log n)^/^n~^/^), a rate that reflects the 
two-dimensional nature of the LTRC model. 

The quantile function Q and its sample estimator are defined by 

Q{p) := inf{t : F{t) > p}, Q„(p) := inf{t : F„(t) > p} 

for < p < 1. The role of the quantile function in statistical data modeling was emphasized 
by Parzen (1979). In econometrics, Gastwirth (1971) used the quantile function to give a 
succinct definition of the Lorenz curve, which measure inequality in distribution of resources 
and in size distribution. 

In the independent framework with no censoring, the properties of estimator Qn (where 
Fn is replaced by the empirical d.f. F„) have been extensively studied (see e.g., Csorgo, 
1983; Shorack and Wellner, 1986). Under 0-mixing condition (for the definition see 
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Doukhan, 1996), the Bahadur representation was obtained by Sen (1972) and the ex- 
tension to the a-mixing case was obtained by Yoshihara (1995). Under a-mixing con- 
dition(see definition below), the strong approximation of the normed PL-quantile pro- 
cess Pn{p) '■= V^fiQip))[Qip) ^ Qnip)] by a two parameter Guassian process at the rate 
0((logn)^^) for some A > 0, was obtained by Fotopoulos et al.(1994) and was later im- 
proved by Yu (1996). 

For a censored model with Xi and Y^'s, independent and identically distributed sequences 
and mutually independent, Padgett and Thombs (1989) stated the strong consistency and 
asymptotic normality for a smooth estimator of Q{p). Sander (1975) obtained some as- 
ymptotic properties, and Csorgo (1983) and Cheng (1984) discussed strong approximation 
results with some applications for Qn{p)- In left truncation and right censorship model, 
Tse (2005), obtained strong Gaussian approximations of the PL-quantile process by a two 
parameter Kifere type process at the rate 0((log n)^/^^-^/^). Ould-Said and Sadki (2005) 
established the strong consistency and a Badadur-type representation of K-M quantile 
function Qni-) under a strong mixing hypothesis. 

The main aim of this paper is to derive strong Gaussian approximations of the PL- 
process and PL-quantile process, for the case in which the underling lifetime are assumed 
to be a-mixing whose definition is given below. As a result, we obtain the Law of the 
iterated logarithm for PL-process. 

For easy reference, let us recall the following definition. 

Definition 1. Let {Xi,i > 1} denote a sequence of random variables. Given a positive 
integer m, set 

(1.2) a(m) = sup{|P(^ ^^ B) - P{A)P{B)\ ; A G r^B G J",-^}, 

fe>i 

where denote the a -field of events generated by {Xj;i < j < k}. The sequence is said 
to be a-mixing {strongly mixing) if the mixing coefficient a{m) — ^ as m — ^ oo. 

Among various mixing conditions used in the literature, a-mixing. is reasonably weak 
and has many practical applications. There exists many processes and time series fulfilling 
the strong mixing condition. In particular, the stationary autoregressive-moving average 
(ARMA) processes, which are widely applied in time series analysis, are a-mixing with 
exponential mixing coefficient, i.e., a(n) = e"'^" for some v > 0. The threshold models, 
the EXPAR models(see Ozaki, 1979), the simple ARCH models(see Engle, 1982; Masry 
and Tjostheim, 1995, 1997) and their extensions (see Diebolt and Guegan, 1993) and the 
bilinear Markovian models are geometrically strongly mixing under some general ergodicity 
conditions. Auestad and Tjostheim (1990) provided excellent discussions on the role of a- 
mixing for model identification in nonlinear time series analysis. 

Now, for the sake of simplicity, the assumptions used in this paper are as follows. 

Assumptions. 

(1) Suppose that {Xi,i > 1} is a sequence of stationary a-mixing random variables 
with continuous distribution function F and mixing coefficient ai{n). 
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(2) Suppose that {Yi, z > 1} is a sequence of stationary a-mixing random variables with 
continuous distribution function G and and mixing coefficient a2{n). Moreover, we 
assume the censoring times are independent of {Xi,i > 1}. 

(3) a{n) = 0(e^ for some ^ > 0, with a(n) = max{ai{n) , a2{n)) (see Remark 
2.1. in Ould-SaYd and Sadki (2005)). 

Cai(Lemma 1, 1998) showed that the Zj's are a-mixing random variables (with appro- 
priate coefficient a). 

The layout of the paper is as follows. Section [21 contains main results. The proofs of 
the main results are relegated to Section 3. 

2. Main Results 

In the first theorem, we construct a two parameter mean zero Gaussian process that 
strongly uniformly approximate the empirical processes Zni{t) = ^/n{An{t) — A{t)) and 

Zn2{t) = ^(Fnit) - F{t)). 

Theorem 1. Suppose that Assumptions (l)-(3) is satisfied. On a rich probability space, 
there exists a two parameter mean zero Gaussian process {B(u,v) u,v >0} such that, 

(2.1) sup\Zniit)-B{t,n)\=0{ilogn)-^) a.s., 

(2.2) sup \Zn2{t) - (1 - F{t))B{t, n)\ = 0{{logn)-^) a.s., 
for A > 0. 

Remark 1. In the a-mixing case, we can not achieve the same rate as in the iid case 
i.e. 0(n"i/2(logn)2)(see Burke et a/.(1988), Theorem l) . The main reason is that our 
approach utilizes the strong approximation introduced by Dhompongsa{1984:) as a kiefer 
process with a negligible reminder term of order 0(n~^/^(log n)^'*'). This is not as sharp as 
in iid case. 

Corollary 1. Under assumptions (1) — (3), we have, 

(2.3) sup|A„(t) 

(2.4) sup|F„(t) 

t&R 

In the next theorem, we construct a two parameter mean zero Gaussian process that 
strongly uniformly approximate the empirical process Pn{p)- 

Theorem 2. Let < po ^ Pi < 1- Under assumptions (l)-(3), assume that F is Lipschtiz 
continuous and that F is twice continuously differentiable on [Q{pq) — 6, Q(pi) +<^] for some 



m\ = o 
m\ = o 



log log n 



n 



log log n 



n 



1/2 



1/2 



a.s., 



a.s. 
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5 > such that f is bounded away from zero. Then there exists a two parameter mean zero 
Gaussian process B{t,u) for t,u >0 such that, 

(2.5) sup \pn{p)-{l-p)B{Q{p),n)\=Oi{logn)-^) a.s., 

Po<p<pi 

with A > 0. 



3. Proofs 

In order to prove Theorem [H we need the followmg lemma. 

Lemma 1. (Theorem 3 in Dhompongsa 1984)- Under assumptions (1) and (3), there 
exists a Kiefer process {Er(s,t), s G > 0} with covariance function 

E[K{s, t)K{s', t')] = r(s, s') min(t, t') 

and T{s,s') is defined by 

oo 

T{s,s') = Cov{gi{s),gi{s'))+Y,[Cov{gi{s),gk{s')) + Cov{gi{s'),gu{s))l 

fc=2 

where gk{s) = I{Zk < s) — H{s), such that, for some A > depending only on v, given in 
assumption (3), 

sup|F„(t) - ll{t) - K{t,n)/n\ = 0{bn), a.s. 

where 

bn = n-'/\logn)-\ 
Proof of Theorem [TJ We start with the usual decomposition of 

Zni{t) = MMt) - Ht)] = ^[ ^ ' *^ ' 



where 



Yn{x) Jo H{x) 

' y/^\H{x)-Yn{x)] ^^, 

— 2 dF^[x) + Rnlit) 

{H{x)f 

Jo Yn{x){H{x)) Jo H{x) 

= h + h + h. 

Define, for t > the sequence of Gaussian processes 

(3.1) B(t.n)= f'S^S^dF.i.). 

Jo [.H(x)Y 
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where K{s,t) is the Kiefer process in Lemma[TJ Clearly, E{B{t,n)) = 0, 

Cov[B{s,m),B{t,n)] = .[^ f -^i^l^dF,{x)dF,{y). 

\ n Jo Jo H{x) H{y) 

where T{s,t) is defined in Lemma[TJ Let 

(3{t, n) = - V^(F„(t) - H{t)) - Kit, n)/V^. 

Theorem [1] is about the order 

(3.2) sup|Z„i(t)-B(t,n)| =sup|i?„i(t)+i?„2(i)l, 

t>0 t>0 

where 

Rn2{t) = / -f^j^dF^ix). 

Jo [H{x)Y 

To deal with Rni{t), we treat Yn{x) as an empirical d.f. associated to Z'-s and from 
Theorem 1 in Cai (1998), we have 

(3.3) h = 0{al) a.s., 
where 

log log ^^'^ 



n 

To estimates of I2, divide the interval [0, t] into subintervals [xj, Xj+i], i = 1, . . . , /c„ where 
kn = 0{a~^), and = xi < X2 <, • • • < Xk„+i = t are such that Xj+i — xi = 0{bn)- Then 



d[Nn{x) - F,{x)] 



k„+l 



d[Nn{x) - F,{x)] 



H{x) 



The integral on the right hand side of the latter inequality is bounded above by (log n)^'^"^/ -^/n, 
almost surely(The proof of this can be done using similar arguments to A in Lemma 3.4, 



H{x) 

side ol 
lis can 

in Ould-Said et al. 2005). Therefore 

(3.4) /2 = 0((logn)-^-'') a.s. 

The estimate of is similar to the estimate of in the proof of Theorem 2 in Cai(1998). 
Hence, 

(3.5) h = 0{hn) a.s. 
Therefore, by combining ()3.3p - ()3.5p . we have 

(3.6) sup|i?„i(t)| = 0((logn)-^-^) a.s. 

Next, by applying Lemma [H we have 

(3.7) sup|i?„2(t)| =0((logn)-^) a.s. 
Combining p.2p . (j3.6p and (j3.7p we obtain (j2.ip . It can be shown that 

(3.8) F„(t)_i7(t) = (l-F(t))[A„(t)-A(t)]+0(i^^^) a.s. 

n 
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Therefore ([22]) is proved via dSSD- □ 

Proof of Corollary [TJ By the law of the iterated logarithm for Kiefer processes (see, 
e.g., Corollary 1.15.1, page 81, in Csorgo and Revesz, 1981), and ()3.ip we have, 

(3.9) sup \B{t, n)\ < Csup \K{t, n)\/^/^ = O 

where C is a positive constant. From ()2.ip . (j2.2p and ()3.9p we obtain the results. □ 

The proof of Theorem [2l is mainly based on the following Lemmas of Ould-Said et. al. 
(2005). Lemma 2 shows that Fn composed with Qn is an approximate identity up to order 
0{bn)- Lemmas 3 and 4 give global and local bounds for the deviation between Qn and Q. 

Lemma 2. Let < po <pi < 1- Under assumptions (l)-(3), assuming that F is continu- 
ous, then 

sup \Fn{Qn{p))-p\=0{hn) a.S. 
PO<P<Pl 

where bn = {logn)~''^ /^/n with A > 0. 

Lemma 3. Under assumptions (l)-(3), assuming that F' = f is continuous and strictly 
positive on [Q{po) — S,Q{pi) + 5] for some 5 > 0. Then, 

sup \/n\Qn{p) - Q{p)\ = 0(\/loglogn) a.s. 

P0<P<P1 

Lemma 4. Let An = const.bn- Under (l)-(3) and assuming that F is Lipschitz continuous 
on [0, r] . Then, 

sup V^\Zn2{s) - Zn2{t)\ = 0{{logn)-^-^) U.S., 

\s-t\<Xn 

where (3 > and A > 0. 

Proof of Theorem [2l We continue to use the notation A^ as in Lemma [H Let 
s = Qn{p) and t = Q{p), Po < p < Pi, Lemma [3] yields -v/n|s — t\ = 0(Vlog logn). 
Applying Lemma S] gives, 

(3.10) F„(Q„(p)) - Fn{Q{p)) = F{Qn{p)) - F{Q{p)) + 0((logn)-^-V^/^) a.s. 

By Lemma O Fn{Qn{p)) can be replaced by p up to 0(bn). For the right hand side, a 
Taylor expansion of the first term about Q{p) up to second order term gives, 

f{Q{p))[Q(V) - Q{P)] + 0{[Qn{p) - Q{p)f ) + 0((logn)-^-VV^) a.s., 
for pq < p < Pi. Invoking Lemma [3] and rearranging terms in (j3.10p . we have, 

V^f{Qip))[Qn{p)-Q{p)] = Mp - FniQip))] + Oiilogn)-"-^) a.s., 

for Pq < p < Pi. Since F is continuous, F{Q{p)) = p. Recalling the definitions of the PL 
process Z„ and PL-quantile process pn, we have, 

(3.11) pn{p) = Zn{Q{p)) + 0{{logn)-''-^/V^) a.s. 



/log log n\ 
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for po < p < Pi- By using Theorem [T] and ()3.1ip . theorem is proved. □ 
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